Picture for Yibin Wang

Yibin Wang

Unified Personalized Reward Model for Vision Generation

Add code
Feb 02, 2026
Viaarxiv icon

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Add code
Feb 02, 2026
Viaarxiv icon

FairExpand: Individual Fairness on Graphs with Partial Similarity Information

Add code
Dec 20, 2025
Figure 1 for FairExpand: Individual Fairness on Graphs with Partial Similarity Information
Figure 2 for FairExpand: Individual Fairness on Graphs with Partial Similarity Information
Figure 3 for FairExpand: Individual Fairness on Graphs with Partial Similarity Information
Figure 4 for FairExpand: Individual Fairness on Graphs with Partial Similarity Information
Viaarxiv icon

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Add code
Oct 02, 2025
Figure 1 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 2 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 3 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 4 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Viaarxiv icon

Towards Confidential and Efficient LLM Inference with Dual Privacy Protection

Add code
Sep 11, 2025
Viaarxiv icon

An U-Net-Based Deep Neural Network for Cloud Shadow and Sun-Glint Correction of Unmanned Aerial System (UAS) Imagery

Add code
Sep 10, 2025
Viaarxiv icon

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Add code
Aug 28, 2025
Viaarxiv icon

DiCache: Let Diffusion Model Determine Its Own Cache

Add code
Aug 24, 2025
Figure 1 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 2 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 3 for DiCache: Let Diffusion Model Determine Its Own Cache
Figure 4 for DiCache: Let Diffusion Model Determine Its Own Cache
Viaarxiv icon

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Add code
Jun 08, 2025
Viaarxiv icon

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Add code
Jun 05, 2025
Viaarxiv icon